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(N ■ Abstract 

In this paper, we propose and study a new semi-random model for graph partitioning problems. We 
believe that it captures many properties of real-world instances. The model is more flexible than the 
semi-random model of Feige and Kilian and planted random model of Bui, Chaudhuri, Leighton and 
Sipser. 

We develop a general framework for solving semi-random instances and apply it to several problems 
of interest. We present constant factor bi-criteria approximation algorithms for semi-random instances of 
the Balanced Cut, Multicut, Min Uncut, Sparsest Cut and Small Set Expansion problems. We also show 
how to almost recover the optimal solution if the instance satisfies an additional expanding condition. 
Our algorithms work in a wider range of parameters than most algorithms for previously studied random 
and semi-random models. 

Additionally, we study a new planted algebraic expander model and develop constant factor bi-criteria 
approximation algorithms for graph partitioning problems in this model. 
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cn 1 1 Introduction 

(N 

' 1.1 Overview 

in ' 

Graph partitioning problems are among the most fundamental problems in combinatorial optimization. They 
have numerous applications in science and engineering. They are also used as basic building blocks in 
many combinatorial algorithms. There has been extensive research on graph partitioning problems, which 
has been mostly focused on analyzing the worst case performance of optimization algorithms. Over the 
last two decades, poly-logarithmic approximation algorithms were developed for such fundamental graph 
partitioning problems as Minimum Bisection BH, Balanced Cut (220, Multicut EH, Min Uncut ll22llT1l. 
Yet, there has been little success in obtaining constant factor approximation algorithms for these problems, 
and some recent results ll27l l28l l35l l36l suggest that this may even be hard, assuming the Unique Games 
conjecture ||26l and its variants. 

However, real-world instances of graph partitioning problems are very different from worst case in- 
stances. To take advantage of this, many attempts have been made lfT3l [T71 [T2l l25l [T6l [l9l l32l [Toll to model 
average instances from practice and design algorithms that perform well in these models. The principal 
question now is — can we come up with a model, which on the one hand reasonably captures instances that 
often come up in practice, and on the other hand, leads to the development of new, interesting algorithms 
with good approximation guarantees in this model? 

Moreover, if we were to believe that these basic graph partitioning problems were hard in worst-case 
it would be ideal to have a distribution of "hard" instances (as in random 3-SAT lfT8l . planted clique H, 
or densest /e-subgraph [9]) that we can use as a test-bed for new algorithms. Further, in certain cases like 
the shortest vector problem on lattices ||2] [3] and the densest fc-subgraph problem [9 |, algorithms for an 
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appropriate average-case distribution of instances have led to new insights for better algorithms in the worst 
case. 

In this paper, we propose and study a new semi-random model for graph partitioning problems, which in 
our opinion captures many properties of real-world instances. We develop a general framework for analyzing 
semi-random instances of graph partitioning problems, and then present bi-criteria constant factor approxi- 
mation algorithms for the "classical" problems of Balanced Cut, Sparsest Cut, Multicut and Min Uncut as 
well as for the Small Set Expansion problem (a problem which has recently attracted a lot of attention). 

Before we proceed with the formal presentation of our model, let us discuss what we can reasonably 
assume about real-world instances. In a graph partitioning problem, the goal is to divide graph vertices 
into several parts, or clusters, so as to minimize the number of cut edges (subject to constraints that depend 
on a specific problem). When a practitioner solves a graph partitioning problem, she usually expects that 
the problem has a good solution — she believes that there is some underlying reason why there should be 
very few edges between clusters. That is, a real-world process that "generates" the graph instance adds an 
edge between clusters only when some random unexpected event happens. Therefore, in our opinion, it 
is reasonable to assume that edges between clusters are added at random. However, we cannot in general 
assume anything about edges within clusters (since their absence or presence does not affect the size of the 
cut between clusters). Additionally, in our model we assume that some random edges between cluster might 
be removed by the adversary (this assumption makes the model more robust). One could also view these 
edges between the clusters as random noise in an otherwise perfect clustering (partitioning). 

This discussion leads to the following informal definition of semi -random instances: consider a set of 
vertices V and some clustering of V. A semi-random graph G on V is a graph with arbitrary (adversarial) 
edges inside clusters and random edges between clusters (more generally, the set of edges between clusters 
might be a subset of a random set of edges). 

Consider a toy example that illustrates why we believe that real-world instances are well described by 
our model. Suppose that we run a wiki website (or online store, online catalog etc). We track what pages 
our visitors read and construct a graph G on the set of all wiki pages V (see e.g., Il33ll24l ). If a visitor goes 
from page A to page B, we connect A and B with an edge. What is the structure of this graph? We expect 
that a visitor will read one article, then read an article that explains some term mentioned in the first one, 
then read another article related to the second one and so on. Sometimes, of course, the visitor will move to 
a completely unrelated article on a different subject. Consequently, there will be two types of edges in our 
graph — edges between pages on the same subject, and edges between pages on different subjects. Edges 
of the first type are not random and show real connections between related articles. However, edges of the 
second type are essentially random. Say, an edge between articles "Ravioli" and "Register Allocation" is 
likely to be completely random and does not show any connection between articles; it just happened that the 
visitor first read an article about ravioli and then decided to read an article on register allocation; in contrast, 
an edge between articles "Ravioli" and "Dumplings" is not random and shows a real connection between 
these food items. To summarize, in our example 

• edges between pages on one subject are not random (i.e. edges within a cluster); 

• edges between pages on different subjects are random (i.e. edges between clusters). 

So G is a semi-random graph according to our model. 

Our semi-random model and results. Now we are ready to give a formal definition. To be more specific, 
let us focus on the Balanced Cut problem. 

Definition 1.1. (Semi-random Model for Balanced Cut) We are given a set V of n vertices, and a 
parameter e. In our model, a semi-random graph G is generated as follows. 

1. The adversary chooses a subset S C Vofn/2 vertices. 
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2. The nature chooses a set of random edges Er between S and V \ S and adds edges from Er to G. 
For every u G S and v G V\S, the edge (u, v) belongs to Er with probability e; choices for all edges 
(u, v) are independent. 

3. The adversary arbitrarily adds edges within S and within V \ S. 

4. The adversary deletes some edges between S and V \S. 

Aim: The performance of the algorithm is measured by comparing the cost of edges cut to the expected 
number of edges in Er (the set of edges chosen at step 2). 

Note that the guarantees are not w.r.t the size of the cut (S, V \ S) after step 4 or with the size of the 
optimal balanced cut. This is essential, since for example for e = 1, Er = S x (V \S) the adversary can 
choose any graph G; so if we compared the cost of the cut with the cost of the optimal cut, our model would 
be the worst case model. 

Informal Theorem. Given a semi-random instance, our algorithm finds a balanced cut (S' , V \ S') with 
\S'\,\V \ S'\ = O(n) of cost 0(\E R \) = 0(en 2 ) with high probability if 

e > \/log n(log log n) 2 /n. 

Informal Theorem.Given a semi-random instance, our algorithm finds a solution to the Small Set Expansion 
problem i.e., a subset S C V of size pn, of cost 0(\Er\) = 0(epn 2 ) with high probability if 

ep> \/log n log(l/ p) (log log n) 2 jn. 

Such results also hold for other basic graph partitioning problems like the Minimum Multicut, Sparsest 
Cut and Min Uncut (the complementary problem to MaxCut). The algorithm for the Small Set Expansion is 
not only interesting on its own, but can also be used to almost recover the original balanced cut under certain 
conditions. See Section [3] for a formal statement of the results. We remark that as e decreases, the problem 
becomes more challenging since the amount of randomness in the instances decreases. 

Note that the algorithm does not necessarily find the planted cut (S, V \ S) since in general this is 
impossible. Indeed the adversary can just delete all edges between S and V \ S and obtain an empty graph, 
or she can add every edge within S and within V\S with probability e and obtain a random G(n,e) graph. 
In either case, our algorithm has no information about the planted cut (£*, V \S). 

However, if we assume that graphs induced by S and by V \ S are combinatorial expanders, we can 
almost recover sets S and V \ S. This assumption is reminiscent of the stability assumption of Balcan, 
Blum and Gupta |7] for clustering problems (we need the extra condition on semi-randomness though). 
This assumption for planted partitioning problems can be justified in the implicit belief that approximately 
optimal solutions are close to the planted partition. 

Informal Theorem. There is a constant C > 1, such that for every constant r/ > 0, given a semi-random in- 
stance G with combinatorial expansion h(G[S]), h(G[V\S]) > Cenander/ > ylog n log ( 1 Irj) (log log n) 2 /n, 
our algorithm finds with high probability the partition (S, V \S) up to an error of±r]n vertices. 

A similar result also holds for the Small Set Expansion problem. 

Planted Spectral Expander Model. In this paper, we also develop bi-criteria approximation algorithms 
for graph partitioning problems on graphs with a planted spectral expander subgraph. Consider a graph G 
with a (planted) balanced cut (S, V \ S). Assume that the normalized algebraic expansion of the induced 
graph G[S] is greater than the combinatorial expansion h^ S y\ s ^ of the cut by some constant factor. Then 
our algorithm finds a balanced cut with expansion 0(h^s ,V\S))- (Note that we do not impose any restrictions 
on the graph G[V \ S] and on edges in the cut (S, V \ S); this result also applies to the case when G[S] is 
a random graph with the appropriate parameters.) We obtain a similar result for the Small Set Expansion 
problem. See Section [7]for details. 
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1.2 Prior Research 



Our work extends prior research on random and semi-random instances of graph partitioning problems. The 
first random model, the planted random model, was introduced in 1984 by Bui, Chaudhuri, Leighton and 
Sipser lfl3l . In this model, we generate a graph on a set V of size n as follows. First, we randomly choose a 
subset S of size n/2. Then we sample every edge between S and V \ S with probability e\, and every edge 
within S and every edge within V\S with probability £2 > £\. Note that all choices in the planted random 
model are random (there are no adversarial choices), so the model describes a probability distribution on 
graphs. The model attracted a lot of attention and was studied in a series of papers by Dyer and Frieze 11171 . 
Boppana [12], Jerrum and Sorkin 11251 . Dimitriou and Impagliazzo |[T6ll . Condon and Kaip [15] and Coja- 
Oghlan |[T4l . These papers explored several techniques for solving the problem — flow -based, combinatorial, 
spectral techniques, simulated annealing and go-with-the-leader technique. The algorithm of Boppana lfl2l 
finds the planted bisection (S, V \ S) w.h.p. if 82 — £1 > Cwe^ log n/n. Later McSherry [32 1 obtained 
similar results for a more general class of graph partitioning problems. 

Coja-Oghlan lfl4l extended the result of Boppana to the case when £2 — £1 > CQ- + yj E2 log (71-62) Jn). 
Note that if £ 2 — S\ = o(y / £2 log n/n) then the random graph has exponentially many minimum bisections 
and the planted bisection is not a minimum bisection w.h.p. fBI . The algorithm of Coja-Oghlan finds a 
minimum bisection rather than the planted bisection w.h.p. 

In 2000, Feige and Kilian fl9l proposed a more flexible semi-random model. The model adds an extra 
post-processing step to the random planted model: after a random graph is generated, the adversary may 
delete edges between S and V \ S and add new edges within S and within V \ S. Semi-random instances 
of Feige and Kilian can have much more structure than random planted instances. Therefore, the model 
arguably captures real-world instances much better than the random model. From an algorithmic point of 
view, an important difference is that algorithms for the semi-random model of Feige and Kilian cannot overly 
exploit statistical properties of random graphs. In particular, spectral algorithms do not work for this model. 
Feige and Kilian [19] developed an SDP algorithm that finds the planted bisection if £2— £1 > C\/^2 log n/n 
(matching the bound of Boppana lfl2l0 . 

In our semi-random model, the adversary has more power than in the model of Feige and Kilian. As in 
their model, the adversary can remove edges between S and V \ S but additionally she has absolute control 
over induced graphs G[S] and G[V\S] (whereas in the model of Feige and Kilian, she could only add extra 
edges to random G(f , £2) subgraphs inside G[S] and G[V \ S]). 

Our algorithm for Balanced Cut, while designed for a more general model, also works in a wider range of 
parameters than the algorithms of Boppana, and Feige and Kilian (note that the objective of our algorithms 
is slightly different, particularly we do not aim to recover the original partition precisely). To compare the 
algorithms, let us assume that probabilities s± and £2 are of the same order of magnitude, e\ = 0(e) and 
£2 = 0(e). While the algorithm of Bopanna[12] and Feige and Kilian [19] require that s > Clogn/n, we 
require only that e > C-y/log n(log log n) 2 /n. (The algorithm of Coja-Oghlan works in even wider range of 
parameters; in the planted random model, it finds an optimal bisection, different from the planted partition, 
when £ > C In.) 

Other Related Research. Previously semi-random models for other combinatorial problems were stud- 
ied by Blum and Spencer ifTTI . Feige and Kilian [ 19], Feige and Krauthgamer |[20l . and Kolla, Makarychev 
and Makarychev |29l . Recently, Balcan, Blum and Gupta [7] (for clustering) and Bilu and Linial [10] (for 
MaxCut) investigated another very interesting model for real-world instances. They suggested that real- 
world instances are stable — there is a unique optimal solution S, and every solution that is far away from 
S is much more expensive then S. Their results however are not comparable with results for the planted 
random model, semi-random model of Feige and Kilian, and our results. 

Guruswami and Sinop [23] recently presented approximation algorithms for partitioning problems on 
graphs with good spectral expansion. Their results (as well as results based on Cheeger's inequality) are not 
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applicable to semi-random instances of Balanced Cut. A semi -random adversary can choose k vertices in S 
and remove all (or almost all) edges incident to them from the graph, making the the first k eigenvalues of the 
Laplacian equal to zero (or close to zero). Moreover, even if the adversary does not modify a random planted 
graph, spectral algorithms based on Cheeger's inequality give only a trivial bound: the second eigenvalue 
of the normalized Laplacian of a random planted graph is A2 ~ £1/62- Therefore, a 0(A2)-approximation 
algorithm finds a cut of cost Q{e2n 2 ), which is far from the cost of the optimal cut (it is actually within a 
constant factor of the cost of the worst/typical balanced cut in G). 

Comparison of Techniques. Our approach was influenced by a recent paper of Kolla, Makarychev and 
Makarychev |29l on semi-random instances of Unique Games. In particular, we use Crude SDPs and the cut- 
long-edges method that were introduced in ll29l . However, from the technical standpoint, this paper and ll29l 
are very different and our algorithms require several new ideas . At high level, the algorithm of |29l (for the 
random edges adversarial constraints model) in one step finds a set of edges (constraints) E~ that contains 
almost all corrupted edges, and then processes E~ . This technique does not work with graph partitioning 
problems since we can find only a set E~ that contains a constant fraction of random edges (edges from Er) 
in one step. In this paper, we have to iteratively solve an SDP or C-SDP program, and remove "long edges" 
at different scales in order to find almost all random edges. Moreover, we cannot just use the technique 
of ll29l at each iteration by a number of reasons. Firstly, the argument of ll29l inherently works only at one 
length scale (a constant scale); if we apply it at the same scale over and over, we will not make any progress. 
Secondly, the algorithm of [29 ] needs the set of edges between S and V \ S to be random, but this set is no 
longer random after the first iteration (it depends on choices of the algorithm that in turn depend on Er). 
Finally, in order to find almost all edges from Er we have to cut "long edges" at smaller and smaller scales 
8p Each time we have to charge the number of cut edges to the cost of the SDP solution. Then the cost 
OPT /5t incurred at iteration t will grow as St goes to and the total cost will significantly exceed OPT. 

In this paper, we develop a technique of hidden solution sparsification that overcomes these difficulties. 
We design a procedure that at each iteration divides the graph into several pieces so that (i) the cost of 
the SDP solution in one of them is much smaller than OPT (the condition is more involved for the C-SDP 
solution), and roughly speaking (ii) all other pieces do not have to be further partitioned. Then we recursively 
apply the algorithm only to the first set. The hidden solution sparsification technique is the main technical 
and conceptual contribution of our paper. We briefly discuss it in Section [T31 of the Introduction. 

It seems that existing algorithms for random and semi-random instances of graph partitioning problems 
cannot be adapted to our semi-random model, since they make too strong assumptions about their input 
graphs, which are not true in our model. In particular, the algorithm of Feige and Kilian crucially uses that 
the cost of the optimal SDP solution for a semi-random instance of the Balanced Cut problem exactly equals 
the cost of the cut (S, V \ S). In our model, this is not the case — the cost of the SDP solution can be two 
times smaller than the cost of the cut. 

1.3 Our Techniques 

In this paper, we develop a general framework for solving semi-random instances of graph partitioning 
problems. 

Let us give a very brief and informal outline of our approach. The core of our algorithms is a hidden 
solution sparsification step (HSS). This step is the same in all our algorithms. Intuitively, the goal of this 
step is to find and remove almost all edges from Er (edges between clusters) by removing at most 0{OPT) 
edges. More specifically, the HSS step finds a set of edges E~ and divides the graph G — E~ into a set M 
and a number of sets Z\ such that: 

1. The cost of the optimal solution for the sub-instance on G[M] — E~ is at most OPT/polylog(n). 

2. Roughly speaking, each Zj does not have to be further partitioned. Formally, we call this condition 
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$ -feasibility. Say, for the Balanced Cut problem, this condition means that each set Z$ contains at 
most cn vertices (for c < 1); for Multicut, it means that each Z\ contains at most one terminal from 
each source terminal pair. 

3. All edges between M and Zi and between sets Zi lie in E~ . 

4. There are "few" edges in E~ . For Balanced Cut and Multicut, we require that \E~\ < 0(OPT); for 
Small Set Expansion we have a more involved condition. 

Then we run an existing polylog(n) -approximation algorithm for the sub-instance on the graph G[M] — E~ 
(e.g. run the algorithm of Arora, Rao and Vazirani ||6] for Balanced Cut). First condition guarantees, that 
the algorithm finds a partition {Mi} of M of cost 0(OPT). We consider the combined partition {Mi, Zj} 
of V. When we solve Balanced Cut or Multicut, the total number of edges cut by this partition is 0(OPT). 
We join together some sets in {Mi, Zj} and obtain a feasible solution of cost 0(OPT) (this step depends 
on the problem at hand). When we solve Small Set Expansion, we get a weaker guarantee on the set E~, so 
the cost of {Mi, Zj} might be very high. We use an extra post-processing step to find a subset of edges in 
E~ that we really need to cut. 

Hidden Solution Sparsification. For simplicity, let us focus now on the Balanced Cut or Multicut 
problem. We find the partition {M, Zi} as follows. We start with the trivial partition M = V and then 
iteratively cut sets Zi from M. Once we cut a set Zi, we do not further subdivide it. We ensure that after t 
rounds the cost of the optimal solution for the sub-instance on G[M] — E~ is 0(OPT '/2* ) and that properties 
(2)-(4) hold. Then after O (log log n) iterations, we get the desired partitioning. 

At iteration t, we solve the SDP relaxation for the problem on G[M] — E~ and obtain an SDP solution 
ip : M — > M. n (the solution assigns vector ip(u) to each vertex u). Since the cost of the optimal solution for 
G[M] - E~~ is 0(OPT/2 t ), the cost of the SDP solution is also 0(OPT/2 t ). The solution defines a metric 
d(u,v) = \\<p{u) — <p{v)\\ 2 on the set M. We analyze the metric at scale St = So/2 t (where So > is an 
absolute constant). For every vertex u, consider the set B u = {v : d(u,v) < St} of vertices at distance at 
most St from u. Let us say that a vertex u is Sflight if \B U \ < S 2 n, and that u is Sfheavy if \B U \ > S\n. 
Denote the set of heavy vertices by H and light vertices by L. Broadly speaking, we first use a procedure to 
remove the heavy vertices H (and further process them to get ^-feasible sets Zi), while cutting only a few 
edges (these cut edges are added to E~). In the remaining graph G[M] — E~ all vertices are light. We show 
that in such a solution, at most an 0(5f ) fraction of edges from Er are shorter than St/2. Here we crucially 
use that Er is a random set of edges (and, thus, the graph G = (V, Er) is "geometrically expanding"). We 
cut all edges in G[M] — E~ longer than St/ 2 and add them to E~ . In the obtained graph G[M] — E~ all 
edges are shorter than St/2, hence it contains at most 0(S 2 OPT) edges from Er. Thus in the next iteration, 
the cost of the optimal solution for the sub-instance on G[M] — E~ is 0(S 2 +l OPT) (as we need). 

The Heavy Vertex Removal procedure finds new sets Zi that cover all heavy vertices H in several rounds. 
In each round, we define a few sets Zi, each Zi contains a subset of heavy vertices together with their r- 
neighborhoods (where r G (St, 2St)). We cut sets Zi away, add edges from Zi to the rest of the graph to E~ 
and then process remaining heavy vertices. We ensure that all sets Zi have a small diameter and this implies 
that sets Zi are ^-feasible. We cut sets Zi so that sets Zi cut in one round are far away from each other and 
the total number of rounds is small. This guarantees that the total number of cut edges by this procedure is 
small (here, we use that each set Zi contains a ball B u for some heavy vertex u, and hence Zi is not very 
small). 

To upper bound the number of edges cut by removing edges longer than St/2, we observe that the SDP 
value in iteration t is 0{S?OPT). The number of these cut edges is 0(S?OPT/S t ) = 0(S t OPT). Thus, 
the number of edges cut in all iterations is 0(X^ SiOPT) = 0(OPT). 

The algorithm for Small Set Expansion (SSE) requires several new ingredients. The main problem is 
that we cannot use the SDP relaxation of Bansal, Feige, Krauthgamer, Makarychev, Nagarajan, Naor and 
Schwartz [8] since it may assign zero vectors to all vertices in S. Instead we use a "Crude SDP" (C-SDP) 
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for the problem. C-SDPs were recently introduced by ||29l . The C-SDP for the Small Set Expansion is not 
a relaxation for the problem; its objective value may be much larger than the value of the optimal integral 
solution (in particular, the value of a C-SDP can be large even if the cost of the optimal solution is 0). To 
solve a semi-random instance of the Small Set Expansion problem, we first apply the HSS step. However, the 
number of edges in E~ is bounded in expectation by the cost of the C-SDP solution and may be much larger 
than the cost of the optimal solution. So our algorithm cannot afford to cut all these edges. Nevertheless, we 
prove that the number of edges in E~ incident to the set S (S is the optimal solution, which is not known 
to the algorithm) is bounded by 0{OPT) (the total number of edges in E~ can be much larger than OPT). 
Then we show how to find a good solution by combining the SDP based SSE algorithm [8 | with a new LP 
algorithm. 

Solution Purification. As mentioned earlier, we show that if we additionally assume that graphs G[S] 
and G[V\S] are combinatorial expanders in the Balanced Cut or Small Set Expansion problem, then we can 
almost recover sets S and G\S (see Theorem l6.2l ). We do that by first finding a good approximate solution 
using our algorithm for Balanced Cut (or Small Set Expansion) and then improving the solution by repeatedly 
solving (semi-random) instances of the Sparsest Cut problem to obtain successively finer approximations to 
the planted partition. 

Organization. In Section [2 we present our semi-random model and give definitions, which we use 
throughout the paper. In Section [3] we state the Hidden Solution Sparsification theorem. We describe our 
approximation algorithms for Balanced Cut, Multi Cut, Small Set Expansion and Sparsest Cut, which rely 
on the HSS theorem, in Sections [3.11 13.21 13.31 and 13.41 Then in Section |4j we prove the HSS theorem for 
graphs with the geometric expansion property. In Section [5] we show that semi-random graph satisfy the 
geometric expansion property and thus conclude the proof of our main result. In Section [6j we show that we 
can almost recover the original partitioning if all parts are combinatorial expanders. Finally, in Section [7] we 
study the planted algebraic expander model. 



2 Preliminaries 

In this section, we define some notation that is convenient for working with partitioning problems. Through- 
out the paper, we let n to be the number of vertices n = \V\ and 7~L = W 1 . We denote by C a fixed constant, 
and this will usually correspond to the approximation ratio given by the algorithm. We do not make an 
attempt to optimize the constants in this version of the paper. 

We use the following notation: the &,-&iame.tev of a set Z C Ti. equals diam(Z) = max{||u — v\\ 2 : 
u, v G Z}; the ^-bafl of radius r around a set Z C 7~L is defined as Ball(Z, r) = {u G 7~L : 3v G 
Z s.t \\u - v\\ 2 < r}. We let Ball(u, r) = Ball({t;}, r). 



2.1 Partitions 

Definition 2.1. Let V be a set of vertices. We say that V is a partition of V into disjoint sets or simply 
partition, ifV = Upe-p P an d ever J two P'j P" G V are disjoint. For every vertex u G V, denote by V(u) 
the unique set P 6P containing u. 

Denote by Is : V — > {0, 1} the indicator function of the set S C V: 



Is(u) 



1, ifueS; 
0, otherwise. 



Definition 2.2. Let G = (V, E) be a graph and V be a partition ofV. Define the set of edges cut by the 
partition as follows 

cut(P, E) = {(«, v) G E : V{u) ^ V(v)}. 
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The cost of the cut equals the size of the set cut('P, E): 

cost("P, E) = \ cnt{V,E)\. 

The cost of the cut restricted to a subset OC1/ only considers those edges which are incident on O: 

cost| (7 3 , E) = | {(u,v) G cut(V, E) : u £ O or v £ O} \ 
= ^2 max{ Jo(it), Io(v)}- 

(u,v)£cut(P,E) 

2.2 Partitioning Problems 

We now define three of the graph partitioning problems that we study in this paper. 

Definition 2.3. (BALANCED Cut) Given a graph G = (V, E), the aim is to find a partition V(Pi,f*i) ofV 
with \Py\ = IP2I = n /2 which minimizes cut(V,E). 

A constant factor approximation algorithm finds a partition V (P[, Pq) with \P1\1\P2\ < /3n /2 for some 
fixed constant 1 < (3 < 2, such that cost{V' , E) < 0(1) cost (V* , E), where V* is an optimal balanced 
cuR 

Definition 2.4. (Small Set Expansion) Given a graph G = (V,E) and a parameter p e (0,1/2], 
the aim is to find a partition V{P\, P2) ofV with \P±\ = pn that minimizes cost("P, E). We will also be 
concerned with constant factor approximations (defined like in Balanced Cut). 

Definition 2.5. (MULTICUT) Given a graph G = (V,E) and a set of terminal pairs {(sj, tj)}i<i<r. the 
aim is to find a partition VofV that separates all terminal pairs Si,t{ (i.e., for all i, V(si) 7^ V(ti)) and 
minimizes cost^T 3 , E). 

2.3 Semi-random Models 

We formally define the first semi-random model. 

Definition 2.6. Consider a set of vertices V and a partition of vertices into disjoint sets V. Let Er = 
{(u,v) : V(u) ^ V(v)} be the set containing all vertex-pairs crossing partition boundaries. Let Ek = 
{(u,v) : V{u) = V(v)} be the set containing all vertex-pairs not crossing partition boundaries. (Thus, 
(V, Ek U Ek) is the complete graph on V.) Consider a random subset of edges Epc of the set Ek: each 
edge (u, v) £ Ek belongs to Er with probability e and these choices are independent. We define a random 
set of graphs SR(V, e) as follows: 

SR(V, e)={G= (V, E) :ECE R U E~k}. 

The optimal cost of the semi-random partition is defined as 

si-cost(V,e) = ^\E R \ = e\E k \. 

'This is sometimes referred to as O(l) pseudo-approximation [6|. 
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2.4 Local SDP Relaxations, Heavy Vertices and ^-Feasible Sets 

Definition 2.7. Let V be a set of vertices. In this paper, we say that a map tp : V —> % is an SDP solution if 
vectors in tp(V) satisfy t%— triangle inequalities: for every u,v,w G V, \\<p(u) — tp(v) || 2 + \\<p(v) — ip(w)\\ 2 > 
\\tp(u) - tp(w)f. 

For instance, solutions to the SDP relaxations in Appendix satisfy the above definition. The SDP 
solution tp defines a metric on the vertices V (given by \\tp(u) — <p(v) \\ 2 for u, v G V). The cost of a solution 
tp corresponds to the total length of edges according to metric given by tp. 

Definition 2.8. Let G = (V, E) be a graph, V be a partition ofV, and O be a subset ofV. Define the cost 
of an SDP solution tp : V — ^ H to be 

sdp-cost(tp,E) = - \\ip(u) - tp(v)\\ 2 , 

(u,v)eE 

and the cost of the SDP solution restricted to the set O to be 

sdp-costioO,^) = - \\<f( u ) ~ V^W 2 ■ 

(ti,u)eE 

ugO or veO 

For any SDP relaxation, the cost of the optimum (minimum) SDP solution lower bounds the value of the 
best integral solution. However, this lower bound may not hold when restricted to a subset O C V. This 
motivates the following definition: 

Definition 2.9. Let V be a set of vertices, V be a partition of V and O Q V. We say that a non-empty set 
of SDP solutions <3? is a O- local relaxation ofV if there exists a constant C > 1 such that for every graph 
G = {V, E) on V and for 

tp = argmin sdp-cost(</?, E) = argmin - ||v 9 (' u ) — V 9 ( v )l| 2 ) 

(u,v)GE 

the following inequality holds 

sdp-costi£)(</?, E) < C cost\o(V, E). 

Note that an SDP relaxation of a problem is always a y-local SDP relaxation of the optimal integral 
solution. 

Definition 2.10. Let V be a set of vertices and $ C {tp : V — > %} be a set of SDP solutions. We say that a 
subset S C V is (^-feasible if there exists tp* G such that for every u,v G S, 

\\tp*( u )-tp*(v)f <i. 

^-feasibility captures sets that require no further processing, to belong to a solution. For example, <3>- 
feasible sets correspond to small enough sets for the Balanced Cut or Small Set Expansion problems, and to 
sets which do not contain any terminal pairs for the Multicut problem. 

An SDP solution tp classifies the vertices into two types (heavy or light) depending on the number of 
vertices in their ^-neighborhoods. 

Definition 2.11. Let V be a set ofn vertices, and MOV. Consider an SDP solution tp : V — > %. We say 
that a vertex u G M is 5— heavy in M if the G^-ball of radius 5 around <p(v) contains at least 5 2 n vectors from 
tp(M) i.e., \{v G M : tp(v) G Ball(<^(it), 5)}\ > S 2 n. We denote the set of all heavy vertices by Hg^M). 
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The following property of semi-random instances is crucially used in our algorithms. 

Definition 2.12. (Geometric Expansion) A graph G = (V, E) satisfies the geometric expansion prop- 
erty with cut value X at scale 5 if for every SDP solution ip : V — > % and every subset of vertices M C V 
satisfying Hg ttp (M) = 0, 

\{{u,v) e E H (M X M) : \\(p(u) -y{v)\\ 2 <5/2}\ < 25 2 X. 

A graph G' = (V, E') satisfies the geometric expansion property with cut value X up to scale 2~ T (T G N) 
if it satisfies the geometric expansion property for every 5 G {2 _t : 1 < t < T}. 

We can slightly simplify the deflnitiorH above by requiring that ip satisfies the condition Hg^V) = 
and M = V. See Section [5]for details. 

In section [51 we will see that in semi-random instances SR(V,e), the graph consisting of the random 
edges (V, Er) is geometrically expanding w.h.p. for sufficiently large e. 

3 Hidden Solution Sparsification and Applications 

In this section, we state the main technical result of the paper, and then show how it can be used to obtain 
constant factor approximation algorithms for the Balanced Cut, Multicut and Small Set Expansion problems 
in the semi-random model. 

Theorem 3.1. (Hidden SOLUTION SPARSIFICATION) There exists a polynomial-time randomized algo- 
rithm that given a graph G = (V, E), a separation oracle for an O— local SDP relaxation <I> of a partition V 
(note: the set O C V and partition V are "hidden" and are not known to the algorithm), and a parameter 
D = 2 T (T G N, T > 1), partitions the set of vertices V into a set M and a collection of disjoint sets Z 

V = MU (J Z, 

zez 

and also partitions the set of edges into two disjoint sets E + and E~ 

E = E + U E~ 

such that 

• all edges cut by the partition V = ML) % He in E~ ( i.e., cut({M} UZ,E) C E~ ), or in other 
words, 

E + C M x Ml) [J Z x Z; 

zez 

• if the graph (V, cut(V, E)) satisfies the geometric expansion property with cut value X up to scale 
1/yD, then (the expectation is taken over random bits of the algorithm) 

E[cost| 0nM (7>, E+)} <CX/D; (1) 

and 

| {(«,«) G E~ -.ueOorv G0}|<C X; (2) 

• each Z G Z is ^-feasible. 

We first show how to construct constant factor approximation algorithms for Balanced Cut, Multicut and 
Small Set Expansion using the theorem. We prover the theorem in Section @] 

2 We note that every Ramanujan expander is geometrically expanding with some parameters. However, we omit the details here. 
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3.1 Balanced Cut 



We show that there exists a constant factor bi-criteria approximation algorithm for the Balanced Cut problem 
in the semi-random model with e > f)(yTogre(log log n) 2 /n). 

Theorem 3.2. There exists a randomized polynomial-time algorithm, a function f : N — > [0, 1] tending to 

as n — > oo, and absolute constants C, Crc> such that for every set of vertices V of size n (for simplicity 
assume n is even), every partition V = {L,R}, \L\ = \R\ = n/2, and every e G (0,1) with probability 

1 — f(n) = 1 — o(l) over random choice ofSR^, e) the following statement holds: for every G = (V, E) G 
S'i?('P,e) the algorithm returns a balanced partition ofV into sets V and R' with \L'\, \R'\ > n/C and 
expected cost of the cut at most: 

E[cost({L',i? / },£;) | SR(V,e)] < C BC max{sr-cost(P, e), ra v/log re(log log n) 2 }. 

Particularly, if e > \/\og n(log log n) 2 /n, then 

K[cost({L',R'},E) | SR(V,e)] < C B csr-cost(V,e) = C BC e—. 

We use the standard SDP relaxation for the Balanced Cut problem. The SDP has a unit vector u for every 
vertex u G V. All vectors satisfy l\ triangle inequalities: for all u,v,w G V: 

\\u — v\\ 2 + \\v — w\\ 2 < 1 1 -tZ — -iZ> 1 1 2 - 

Finally, all vectors satisfy the spreading constraint (below we count every pair as (u, v) and (v , u))\ 

ii n- 2 

u,v£V 

The objective function of the SDP equals 1/2 sdp-cost(u h-» u,E). (For clarity, we give the SDP in 
Appendix O) 

The SDP relaxation defines a set of feasible solutions <£. This set is a y-local SDP relaxation for V, since 
every SDP relaxation is always a F-local SDP relaxation. Indeed, sdp-cost|y (ip, E) = sdp-cost(</?, E) for 
every ip (just by definition), and particularly, for <p* = argmin^g^ sdp-cost(</?, E), 

sdp-costiy(y?*, E) = sdp-cost((/5*, E) = minsdp-cost((/?, E) 

1 </3G<I> 

< 2cost(V,E) = 2cost\ v (V,E). 

(The factor of 2 appears because of a different normalization of the objective function.) 

In the algorithm below, we use the ARV algorithm for finding a balanced cut (in the worst-case) of Arora, 
Rao, and Vazirani [6]. We denote the approximation factor of the algorithm by Darv = 0{y/\og n). For 
simplicity of exposition we assume that Darv is a power of 4. 



Balanced Cut Algorithm in Semi-random Model 

Input: a graph G = (V, E) G SR{V, e) 
Output: a cut (L',R'), with \L'\, \R'\ > n/C 

• Run the Hidden Solution Sparsification Algorithm with a separation oracle for $ and obtain a set 
M C V, a partition Z of V \ M in disjoint ^-feasible sets and two disjoint sets of edges E + and E~ 
(with parameter D = Darv)- 
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• Run the ARV algorithm on the graph G = (V, E + ), and obtain a balanced partition (L', R'); 

• return (L', R'). 



Analysis. We show that every set Z in Z is balanced. Every set Z G Z is <I>-feasible, that is, for some 
<p(Z) has I 2 diameter at most 1/4. Thus, 

1 £ ||^)-^)f < 



< 

By the SDP spreading constraint, the left hand side is greater than or equal to n 2 /2, thus \Z\ < n < 

4/5 n. 

By the Structural Theorem 15.11 with probability 1 — f(n) = 1 — o(l) for every graph G = (V,E) G 
SR(V, e), the graph (V, cut(V, E)) is geometrically expanding with cut cost 

X = C maxjsr-cost^, e), ny / logn(log log n) 2 } 

up to scale 1 / \JTJ~arv- Thus, by Theorem 13. II 

cost , M <CI/fl M . 

Hence, there are at most C X/Darv edges in E + going from L D M to R fl M. Observe, that |L n M| < 
|L| = n/2 and |i?n M\ < = n/2. Therefore, there are at most C X/Darv edges in E + cut by the the 
partition 

V = (M fl L) U (M n i?) U J 2 

zez 

(the only edges cut are the edges between M n L and MnR) and each of the sets in the partition has size 
at most 4/5 n. These sets can be grouped into two balanced sets L* and R* with \L*\, \R*\ > 1/5 n. The 
ARV algorithm finds a possibly different balanced cut (L', R!) (with slightly weaker bounds on \L'\, \R'\). 
The number of edges cut in E + is bounded (in expectation) by Darv x C X/Darv = C X. The number 
of edges cut in E~ is bounded by \E~\ < C X. 

3.2 Min Multicut and Min Uncut 

The algorithm for the Multicut problem is similar to the algorithm for Balanced Cut. We use the standard 
SDP relaxation for Multicut: the SDP has a unit vector u for every vertex u; vectors Si, ti corresponding to 
source-sink pairs Si, ti are orthogonal ((sj, Vi) = 0); all vectors satisfy the i\ triangle inequality constraints 
(please see Appendix O for details). The key observation is that every ^-feasible set Z G Z has a small 
diameter w.r.t. some SDP solution and thus may not contain a source-sink pair Sj, ti. Finally, to find a 
solution in M we use the algorithm of Garg, Vazirani, and Yannakakis [21]. 

We get a constant factor approximation algorithm for the Multicut problem in the semi-random model 
withe > logn(loglogn) 2 /ra. 



- V max(||<p(u) - <p(v)\\ 2 ) - 
z * — ' u,»ei/ 

u,v£V 

~\ E (maxdl^n)-^)!! 2 )- max(||^)-^)|| 2 )) 

u,v£Z 

n 2 --\Z\ 2 . 
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Theorem 3.3. There exists a randomized polynomial-time algorithm, a function f : N — > [0, 1] tending to 
as n — > oo, and an absolute constant C, such that for every set of vertices V of size n, every partition V, and 
every e G (0, 1) with probability l—f(n) = 1— o(l) over random choice ofSR(V, e) the following statement 
holds: for every G = (V, E) G SR(V, e) and every set of demands (sj, U) (satisfying V(si) ^ Vfa)), the 
algorithm returns a partition V' ofV separating the demands (V'(si) ^ V(ti)) with expected cost of the 
cut at most: 

K[cost(V',E) | SR(V,e)] < Cniax{sr-costCP, e),nlogra(loglogn) 2 }. 

A similar statement holds for the Min Uncut problem if e > \/log n(log log n) 2 /n. A semi-random 
instance of Min Uncut is generated as follows: the adversary first chooses an arbitrary subset S of vertices, 
then the nature connects each pair of vertices (u,v) G S x SU(V\S) x (V\S) with an edge with probability 
e, finally the adversary adds arbitrary edges between S and V \ S, and removes some random edges. The 
problem can be restated as a cut minimization problem that falls in our framework (see e.g. |]T|). Our 
algorithm for Min Uncut first runs the Hidden Solution Sparsification algorithm and then uses the algorithm 
of Agarwal, Charikar, Makarychev, and Makarychev [1] for Min Uncut. We defer the details to the journal 
version of the paper. 

3.3 Small Set Expansion 

Our algorithm for the Small Set Expansion (SSE) problem is the most involved, and uses the full power of 
the Hidden Solution Sparsification theorem. 

Theorem 3.4. There exists a randomized polynomial-time algorithm, a function f : N — > [0, 1] tending to 
as n — > oo, and an absolute constant C, such that for every set of vertices V of size n, every partition 
V = {S, V \ S}, \S\ = pn (for p G (0, 1/2)) and every e G (0, 1) with probability 1 - f(n) = 1 - o(l) 
over random choice of SR(V,e) the following statement holds: for every G = (V,E) G SR(V,e), the 
algorithm given G and p, returns a partition V' = (5", V \ S') ofV such that \S'\ = @(pn), \S'\ < \V\/2 
with expected cost of the cut at most: 

E[costCP',£) | SR{V,e)) < C'max{sr-cost(7 3 J e), n^/log n log(l//o) (log log n) 2 }. 

Particularly, if ep > y^log n log(l/ p) (log log n) 2 /n, then 

E[cost({L, R},E) \SR(P,e)] < Csr-cost(P, e) = Cep(l - p)n 2 . 

Moreover, instead of requiring that G = (V,E) G SR(V,e), it suffices that the graph (V, cut("P, E)) is 
geometrically expanding with cut cost 

X = C / max{sr-cost('P, e), ny^og n log(l/ p) (log log n) 2 } 

(for some absolute constant C') up to scale s(n, p) = S7(y/k>gnlog I). 

By Theorem 15.11 for every graph G G SR(V,e), the graph (V, cut(7 3 , E)) is geometrically expanding 
with cut cost 

X = C' max{sr-cost("P, e), n \/\og n log(l/ p) (log log n) 2 } 

up to scale s(n, p) = 0(ylog~nlog ^) with probability 1 — o(l) over random choice of SR(V, e). We as- 
sume that the graph (V, cut(V, E)) is geometrically expanding. Otherwise, the algorithm fails (this happens 
with probability o(l)). 
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We use an analog of the Crude SDP (C-SDP) introduced in the paper of Kolla, Makarychev and Makarychev 
|29l . For each vertex u G V the C-SDP has a unit vector u G %. All vectors satisfy triangle inequality 
constraints and spreading constraints (similar to constraints introduced in Bansal et al. [8]; note that their 
SDP would not work in our case: loosely speaking, it may "find" a good fractional cut that assigns zero 
vectors to the real solution): for every u G V, 

^2(u,v) < pn. 

vev 

We give the C-SDP in its entirety in Section O Note that this SDP is not a relaxation for SSE. However, it 
turns out that this is a S'-local SDP relaxation of partition (S, V \ S) (see Lemma |A~D) . We now use the 
Hidden Solution Sparsification algorithm to find the set M and a partition of V \ M into ^-feasible sets 
Z G Z. Here <I> is the set of feasible C-SDP solutions. We set the weight of every vertex u G M to be the 
number of edges in E~ incident on u: w u = \{v : (u,v) G E~}\. w u corresponds to the cost we would 
pay for cutting the w u edges incident on u from E~ , if u were included in the solution (small set). Observe, 
that the weight of the "hidden" set S is at most C\X (for some absolute constant C\, see (f2])). Then, we 
consider two cases: \M n S\ > \S\/2 and \ (V\M)DS\ > \S\/2, depending on whether most of the hidden 
set S vertices belong to M or the pieces Z G Z of the partition (the algorithm does not know which of the 
inequalities holds and tries both options). 

Case I: This case is handled similar to the proof in Sections I3TT1 Since most of S (the hidden solution) 
belongs to M, we know that there is a good solution M n S in G(V, E + ) i.e. S n M has size G [pn/2, pn] 
with weight w(M n S) < C\X, and there are at most CX/Dsse from E + going out of S n M. Now, we 
use the following theorem of Bansal et al. J8 ] which finds small non-expanding sets. 

Theorem 3.5. (Special case of Theorem 2.1 OD, arXiv version) There exists a polynomial-time 
algorithm ("SSE algorithm" ) that given as input a graph G = (V, E), a set of positive weights w u (u G V), 
p G (0, 1/2] and W G M + , finds a non-empty set S C V satisfying \S\ G [Q(pn),3pn/2], and w(S) = 
Y^ugS w u — CW, such that 

E(S, V\S)< Dsse ■ min{E(S, V \ S) : \S\ = pn, w(S) < W}, 
where D S se = O(v / !ognlog(l/ / o)). 

Remark: This theorem is stated in a slightly different form in Bansal et al. [8]. We discuss the differences 
in Appendix (Section [B]). 

We use this SSE algorithm on G(V, E + ) to find a set S' with |5'| G [n(pn),3pn/2], w(S') = J2ueS' w ^ ^ 
C CiX, and 

co S t({S', V \ S'}, E+) < Dsse x CX/D SS e < CX. 

The total cost of the cut E(S', V \ S') is bounded by the number of edges cut in E + and E~ which is at 
most CX and CC\X respectively, which is 0(X) as needed. 

If p G (1/3, 1/2), the set S' may contain more than re/2 vertices, but no more than 3pn/2 < 3/4n. Then 
the algorithm returns S" = V \ S' satisfying \S"\ G [re/4, re/2]. 

Case II: In this case, the hidden solution S could mostly be spread arbitrarily among the pieces Z G Z 
in V \ M. Here, our algorithm uses an LP to extract the solution from the set V \ M. The key observation 
is that this set is already partitioned into pieces of small size. Indeed, every Z G Z is ^-feasible, and thus 
for some (p G <3?, diam((/?(Z)) < 1/4 and, consequently, for every u,v G Z, ((p(u),ip(v)) = (\\(p(u)\\ 2 + 
ll^^)!! 2 — II^W — ( / ? (' u )ll 2 )/2 > 7/8. Using the C-SDP spreading constraint (for an arbitrary u G Z), 

J2(ip(u),(p(v)) < pn, 
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we get \Z\ < 8/7 pn. 

The LP has a variable x v G [0, 1] for every vertex u G V\M ; and the only constraint is that J2 u gv\m x u — 
pn/2). The objective function is 



mm 



^ ^ w u %u ~i~ ^ ^ l^u (3) 

uey\M («,u)e£+ 
u,v£V\M 

The canonical solution to this LP is as follows: x u = 1, if it G S 1 n (V \ M); x u = 0, otherwise. The 
LP cost of this solution is at most CX, because the first term in the objective function is bounded by C\ X 
(see (O), the second term is bounded by the size of the cut(V, E), which is at most C2 X (The expected 
size of the cut equals sr-cost("P, e); by the Chernoff bound the size of the cut is less than 2 sr-cost(7 ? , s) 
with very high probability). Thus, the cost of the optimal solution {x* }, which we denote by LP*, is at most 
C X = (C\ + C2)X. For an integral solution S' C V, we define the cost 



f(S')= J> u + \E+(S',V\S')}\ (4) 

ueS' 

= J2 Wu + l{( n ' u ) G E + : u G S',v $ S'}\. 
ueS' 

For every r G [0, 1] define S r = {u : x* u > r}. The algorithm finds r* that minimizes the ratio 
f(S r )/\S r \ subject to \S r \ > pn/A (note: \S\\ = \V\M\ > pn/2). Then it sorts all sets Z G Z in order 
of increasing ratio f(S r * n Z)/\S r * n Z\ (ignoring empty sets) and gets a list Z\, . . . Zk- It picks the first k 
pieces such that 

\Zi n s r * \ + \z 2 nS r *\-\ h \z k n SV* | G [pra/4, 2/m], 

and returns 

k 

S' = [JZiDSr*. 

i=l 

Note, that such k exists because each piece Zi n 5 r * has size at most 8/7 (as |Zj| < 8/7 pn) and 
X)iLi \ z i ^S r *\ = \S r *\ > pn/4. 

Analysis of Case II. We first prove that 

f(S r *) < 4LP*/(pn) .\Sr*\. 

Observe that 



f f{S r )dr = LP* [ \S r \dr > 

Jo Jo 



pn 



The first equality easily follows from © and d4), the second equality follows from the LP constraint. Let 

R = {r : \S r \ > pn/A}. Then 

S r \dr > ^- / |5 r |dr > 



R 2 J[0,1]\R ' 4 

and, since r* = mm{f(S r )/\S r \ : r G R}, 

f( S r*) | u. s fjSr*) f>n 

\S r A ' 4 ' 



LP* = [ f(S r )dr > [ \Sr\dr > 

JR JR \&r*\ 
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Thus, f(S r *) < ALP* I {pn) ■ \S r *\. Using that edges in E + do not cross the boundaries of sets Zj, we get 

K K 
AT P* 

Recall, that {/(Zj n SV*)/|S r * n Zj|}j is an increasing sequence, thus 

/(£') = /((J ^ n S r *) = ^ /(Zi n S r *) 

i=l i=l 
4LP* A,„ , 4LP* 



< Yj \ Sr * nZ i\ = \ S '\ ^ WLF*. 



pn pn 



3.4 Sparsest Cut 

We now show how to find an approximate sparsest cut in a semi-random graph G using the algorithm for 
Small Set Expansion. Specifically, we give an algorithm that for every subset U C V intersecting each of 
the pieces of the planted partition (S, T) (see below for details), returns a cut {A, U \ A) of sparsity 

E(A f A } A) < om. 

\A\ - v y 

In Section [6l we show that the Sparsest Cut algorithm can be used to recover pieces S and T assuming that 
the graphs G[S] and G[T] have large expansion. We remark that while we are usually concerned with the 
case when U = V for the sparsest cut problem, the following stronger statement is also useful for Section [6] 

Theorem 3.6. There exists a randomized polynomial-time algorithm, a function f : N — >■ [0, 1] tending to 

as n — > oo, and an absolute constant C, such that for every set of vertices V of size n, every partition 
V = {S, V \ S} and every e,n € (0, 1) satisfying en > \/log n(log log n) 2 /n with probability 1 — f(n) = 

1 — o(l) over random choice of SRiV, e) the following statement holds: 

for every G = (V,E) £ SR(V,e), every U C V such that \U t~) S\ > nn and \U n T\ > nn, the 
algorithm given G, returns a partition (A,U \ A) of G[U] with \A\ < \U\/2 such that with probability 
exponentially close to 1, 

\E(A,U\A)\ 

j-^j < Cscen. (5) 

Proof Sketch. We first give a proof assuming en > yjlog n log (1/7/) (log log n) 2 /n. 

Our algorithm guesses the size of | S Pi U\ , computes the size of|Tnf7| = \U\ — \SC\U\. Then, it runs the 
Small Set Expansion algorithm on G[U] with p = min(|5n?7|, \TnU\)/\U\, obtains a set A (\A\ < \U\A\) 
of size @(p\U\) and returns the cut (A, U \ A). We need to show that the size of the cut (A, U\ A) is at most 
0(ep\U\n), so that the sparsity of the cut is then 0(en). 

Let us explain why we can use the Small Set Expansion algorithm for the graph G[U] and why the 
algorithm finds a cut of cost at most 0(ep\U\n). By the structural theorem (Theorem 15. II part II), with 
probability 1 - o(l), for every U C V, the graph (U,E D (U X U) D (S X T)) (i.e., the bipartite graph 
between pieces U H S and £7" fl T) is geometrically expanding up to scale ^/\ogn\og{l/n) with cut value 

X = Cmax{sr-cost(7 :, |;7, e), ny/\og n log(l/r/) (log log n) 2 } 
= Cmax{e|5n f/| • \T D U\, n^\og n log(l/r/) (log log n) 2 }. 
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Below, we assume that the graph (U, E D {U x U) H (S x T)) is geometrically expanding; otherwise our 
algorithm fails (which happens with probability o(l) over the choice of SR(V, e)). Write the lower bound 
on er\ and a trivial inequality one|5nf/|-|TnC/|: 

ylog n log (I/77) (log log n) 2 < erjn < emin(|5 n U\, \T Pi U\)n; 

e\SC\U\ ■ \THU\ < emin(|snc/|,|rnc/|)n. 

Together these inequalities give us an upper bound on X: 

X < Cemm(\Sr\U\,\TnU\)n < Cep\U\n. 

By Theorem [331 the Small Set Expansion algorithm returns a cut of size O(X) (Here we use that the graph 
(U, E n (U x U) n (S x T)) is geometrically expanding up to scale ylog n log(l/r/) > \/lognlog(l/p)). 

We showed that the algorithm finds a cut of sparsity a = 0(en) in expectation. By Markov's inequality, 
it finds a cut of sparsity at most 2a with probability at least 1/2. So by repeating the algorithm many times 
and then picking the best solution, we can get a solution of cost at most 2a with probability exponentially 
close to 1 . 

Finally, let us briefly explain how to get rid of the y/log(l/rj) factor in the lower bound on erj. Observe 
that it suffices for our algorithm to find a set A of size \A\ G [0(p|f7|), |J7|/2] i.e., we do not need a bound 
\A\ < 0{p\U\). So we slightly modify the Small Set Expansion algorithm so that it works for smaller erj, 
but possibly returns \A\ S> p\U\. In Case I of the algorithm (see Theorem I3.4K we use Theorem 2.1 (part 
I) instead of Theorem 2.1 (part II) of Bansal et al. [8] with p = 1/2. This algorithm returns a sparse cut of 
size at most p\U\ = \U\/2 of sparsity 0{y/\og n)OPT (where OPT is the optimal sparsity of the cut). We 
repeatedly apply this algorithm and obtain disjoint sets A\, . . . , At- After we get a set Ai, we remove it from 
U. We stop when | U At\ > p\U\/4. We let A = L)A t . It is not hard to show that the sparsity of A is at most 
0{y/\og~R)OPT (where OPT is the value of the sparsest cut in {U,E + )) and | A\ G \Q(p\U\), \/2\U\\. The 
proof is similar to the proof of Theorem 2.1 (part II) in [8 ]. We omit it in this version of the paper. □ 

4 Hidden Solution Sparsification Algorithm 

We now present the Hidden Solution Sparsification Algorithm and prove Theorem 13.11 The algorithm runs 
in O (log log n) phases. In each round, we first solve the SDP on the current instance. The heavy vertices 
w.r.t. to this vector solution are first processed and removed using the algorithm from Section 14.11 In the 
remaining graph, we remove (cut) long edges to further "sparsify" the hidden solution {Er) and produce the 
instance for the next phase. 



Hidden Solution Sparsification Algorithm 

Input: a graph G = (V, E) and a separation oracle for a set of SDP solutions $ C {V — > %}. 
Output: partitions V = ML) \J Z ez % and E = E + U E~ . 

• Let M = V, Z = 0, E^ = E, E Q = 0, T = \ log 2 D, and 6 t = 2~ f for all t = 1, . . . , T. 

• for t = 1, . . . ,Tdo 

A. Solve the SDP for the remaining graph: Find 

(p t = axgminsdp-cost(<p, E^_ 1 n (M t _i x M t _i)). 
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B. Remove 5 t -heavy vertices: run Heavy Vertices Removal Algorithm (described in Section 14.11 ) 
with parameters V, M t -i, ipt, and obtain a collection of ^-feasible sets AZ t . Add edges in 
E^_ x cut by AZ t to the set AE^ . Let 

Z t = Z t ^UAZ t ; M t = M t „ 1 \ [j Z. 

ZeAZt 

C. Remove dt—long edges from E + : Find 

Lt = {(u,v) £E + :u,v£ M t , \\tpt(u) - <p t (v)\\ 2 > S t }. 

Let 

E+ = EU \ (AEi U L t )- E; = E-_ x U (AEj~ U L t ). 
return M = M T , Z = Z T , E + = E£, E~ = 



Proof of Theorem 1X71 We analyze the algorithm given above. We note that the step A of finding tpk can be 
performed in polynomial-time using semidefmite programming; the step B is performed using the algorithm 
described in the next subsection. 

At every iteration, the algorithm removes all edges crossing the partition AZ t from E^~ and adds them 
to Ef, hence the first item of Theorem B. 1 I holds. The third item holds, because every set Z G Z belongs to 
some AZ t and, thus by Lemma PTT1 (see below), diam(<^(Z)) < 1/4. 

We now show that the second item of Theorem 13.11 holds . We first prove that 

cost| Mt (P,£+)<2X-5 t 2 

for every t G {0, . . . , T}. The Heavy Vertices Removal Procedure returns set M t that does not contain any 
5t-heavy vertices w.r.t. (ft i.e., Hs t ^ t (M t ) = (see Lemma |4~TI ). Using the geometric expansion property 
ofthegraph(y,cut(S,P)),weget'|{(ii,t;) G cut(V, E)n{M t xM t ) : \\ipt(u)-ip t (v)\\ 2 < 5 t /2}\ < 25 2 X. 
The algorithm removes all ^t/2-long edges at step C, thus the set D {Mt X M t ) contains only edges (it, v) 
for which \\ip t (u) — cpt(v)\\ 2 < St/2. Combining this observation with the previous inequality, and using that 
edges in Ef do not cross the boundary of M t , we get 

cost| A/t (-p, Ef) = \cut(E, V) n Ef n {Mt x M t )\ < 25 2 t X. (6) 

For t = T, we get cost| M (P, E + ) < 2 X/D. 

Finally, we estimate the size of the set {(u, v) G E~ : u G O or v G O}. To do so, we use that $ is a 
O-local relaxation of the partition V. For graph G = (V, Ef_^ n (Mt_i x M t -i)), we obtain inequality 

sdp-costpfat, E+_ j n (M t -i x M t -i)) < Ci cost| (7>, £ t ti n ( M t-i x 

= d costionMt.! {V, E+ J < 2CX • <5 t 2 _i 

= 8Ci<5 t 2 X 

The second line of the inequality follows from ©. 
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Now, we bound the number of edges removed from Ef_ x C\0 and added to E t PiO in terms of "sdp-cost". 
At step t, we add two sets of edges to E~: AE^ and L t . Since all edges (u, v) in L t are 5t/2-long (i.e., 

\\<p(u)-<p(v)\\*>St/2), 

, „ u \\<p(u) - <f(v)\\ 2 \L t n(0 x V)\ -S t /2 

(«,^)e-B+_ 1 n(Af t _ixM t _i) 

(u,d)60xV 

Hence, | Lt n (O x V) | < 32Ci 5tX. The probability that the Heavy Vertices Removal Procedure separates two 
vertices u and v connected with an edges in E 1 ^_ l is at most C2{5^ 1 + 5^ 2 W\Mt-i\M t \ / n) ■ \\ ip(u) — f{v) || 2 
(see Lemma |4~TT) . Thus, the expected total number of edges in the set AE^ n (O x V) is at most 

/ , pE|M t _i\M t K . , N / E|Aft_i\Af t K 

C 2 (6t 1 + St 2 1 — x f| j •sdp-cost| (^, J B+ 1 n(Mt_i xMt-i)) < 8C f iC , 2 fo + 1 X *' JX 

The total number of edges in E~ n (0 x V) is bounded by 

Y (32Ci<5 t + 8CiC 2 Jt + 8CiC 2 • E l M *-i \ M *l ) X < + + 8ClC < 2 ) X 

t=i 

□ 



4.1 Heavy Vertices Removal Procedure 

In this section, we describe the algorithm which deals with the heavy vertices in a vector solution. Note that 
in the intended vector solution for all the above problems, all vertices are heavy (the intended solution for 
Balanced Cut has n/2 vectors at a fixed unit vector vq and the rest n/2 of them at —vq). This algorithm also 
shows how we can take advantage of vector solutions which look like the intended solution (say, roughly 
low-dimensional solutions), with many heavy vertices. 

Lemma 4.1. There exists a polynomial-time algorithm that given a set of vertices V, an SDP solution 
(p : V — > T~L, a subset M C V, finds a set of vertices M' C M and a partition of M \ M' into disjoint sets 
Z G AZ such that 

• the set M' does not contain any S— heavy vertices (iLj „(M') = 0)w.r.t. ip. 

• diam((^(Z)) < 1/4 for every Z G AZ; 

• for every two vertices u* and v*, the probability that u* and v* are separated by the partition is 
bounded as follows: 

Pi(3Z G AZ s.t. I z (u*) ± l z (v*)) < CfV 1 + <T 2 E[|M ^ M/|] ) \\<p(y*) - ^(v*)\\ 2 - 

We remark that some of the heavy vertices H$m{M) may belong to M', but they are not heavy anymore 
(w.r.t M'). 

Proof. We use the following algorithm. If 5 > 1/32, we run the algorithm with 5' = 1/32. 



Heavy Vertices Removal Procedure 

Input: a set of vertices V, a subset M C V , an SDP solution tp : V — > "H, a parameter 5 G (0, 1/32]; 
Output: a set M C V, partition V \ M = \J ZeAZ Z; 
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• while (H SjV (M) ^ 0) 

- Connect heavy vertices in M at l\ distance at most 45 with an edge and denote the new set of 
edges by A = {(u,v) G H S , V (M) x H S „{M) : \\<p{v) - ^{v)\\ 2 < 45}. 

- Break graph (H$ tip (M), A) into connected components. 

- Pick a random r G [5, 25). 

- Remove components of small diameter: For each connected component U with diam(</?(?7)) < 
1/8, let 

B v = {v G M : 3u G U s.t. \\ip(u) - <p(v)\\ 2 < r}. 
Denote the set of all connected components of diameter at most 1/8 by U. 

- Remove a maximal independent set: In the remaining set Hs lV (M) \ (Jt/eW U ^ n< ^ a maximal 
independent sell S. For each u G S, let B u = {v : tp(v) G Ball(u, r)}. 

- Remove sets Bjj and B u from M: 

M = M\ ( |J S^U |J B tt ); 

• return M' = M. 

Analysis. It is clear that the algorithm always terminates in polynomial-time (since at every step at least 
one vertex is removed). When the algorithm terminates Hg i(p (M) = by the condition of the "while" loop. 
Every set tp(B u ) removed from M and added to AZ at one of the iterations is contained in a ball of radius 
at most 25; every set <p(Bu) is contained in the 25-neighborhood of a set <p(U) (for some U G U) whose 
diameter is at most 1/8. Thus, the diameter of each ip(B u ) and <£>{Bjj) is at most 1/8 + 45 < 1/4. 

Verify the third item of Lemma |4~D Fix two vertices u* and v*; and consider one iteration of the 
algorithm. We may assume that the algorithm first picks the independent set S and a collection of connected 
components U, and only then chooses random r G [5, 25). Observe, that the distance between (images of) 
any two vertices in S is at least 45 (because S is an independent set), the distance between every two sets in 
U is at least 45 (because every U G U is a connected component), and the distance between every U G U 
and u G S is at least 45 (again because U is a connected component, and u £ U). Thus, <p{u*) may belong 
to at most one Ball([/, 25) or Ball(it, 25). If (p(u*) G Ball(n, 25), then 

Pr(^(u*) G Ball(u,r), ip(v*) (£ Bailor)) < <5 -1 ||^(it) - <p(v)\\ 2 . 

Of course, if ip(u*) £ Ball(u,25), then Pr(^(u*) G Ball(u,r), tp(v*) <£ Ball(u,r)) < Pi(ip(u*) G 
Ball(n,r)) = 0. 

The same statements hold if we replace u G S with U G U. Thus, at one iteration, the probability that 
u* belongs to a removed ball but v* does not belong to the same ball is at most 5~ x \\tp(u) — <p{v) || 2 . Denote 
by T the number of iterations of the algorithm. Then, the probability that u* and v* are separated at one of 
the iterations is at most 25~ 1 E [T] \\ ip(u) - p(v)\\ 2 . 

We now prove that at every iteration but possibly the last, the algorithm removes at least 5n vertices 
from M. Thus, E[T] < 1 + E\M' \ M\/(5n), and the third item of Lemma gj] follows. Observe, that if 
the independent set S = 0, then the algorithm terminates. If S ^ 0, there exists at least one connected 
component L with diam(yj(L)) > 1/8. The maximal independent set in L must contain at least ^(c) -1 ) 
vertices, since for every edge (u, v) G A, \\ip(u) — cp(v)\\ 2 < 45. Thus, |5| > S~2((5 — 1 ). Since each u G S is 
(5-heavy and r > 5, \B U \ > 5 2 n. Hence (using the fact that sets B u are disjoint), 

| |J B u \ = \B U \ > Sn. 

ues ues 

□ 

3 This is done independently of the random variable r, e.g., using a deterministic greedy algorithm. 
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5 Structural Theorem 



We now prove that semi-random graphs are geometrically expanding, namely we prove that with high prob- 
ability for every semi-random graph G = (V,E) G SR(V,e) the graph (V, cut(V, E)) is geometrically 
expanding. 

Theorem 5.1. /. There exists a function f : N — > [0, 1] satisfying linin^oo f(n) = such that for every 
set of vertices V of size n, every partition V, and every e G (0, 1), D = 2 T (T G N, T > 1) with 
probability 1 — f(n) = 1 — o(l) the random set SR^^e) satisfies the following property: for every graph 
G = (V, E) G SR(V, e), the graph (V, cut(V, E)) is geometrically expanding with cut cost 

X = Cmax{sr-cost(7V), nD (log 2 D)} 

up to scale 1 j\[D. 

II. Moreover, a slightly stronger statement holds. For every set of vertices V of size n, every partition 
V, and every e G (0, 1), D = 2 T (T G N, T > 1) with probability 1 — f(n) = 1 — o(l) the random set 
SR(V, e) satisfies the following property: for every graph G = (V, E) G SR(V, e) and every U C V, the 
graph (U, cut('P, E) (U x U)) is geometrically expanding with cut cost 

X = Cmax{sr-cost(:P| v ,e), nD (log 2 D)} 

up to scale \j\f~D. Here Vm = {P H U : P G V} denotes the restriction of the partition V to the subset U. 

We defined Geometric Expansion in Section |2] We now give a slightly different definition of Geometric 
Expansion which is equivalent to Definition ^. 12[ but is more convenient for proving Theorem 15.11 

Definition 5.2. (GEOMETRIC EXPANSION; SEE DEFINITION 12.121) A graph G = (V, E) satisfies the ge- 
ometric expansion property with cut value X at scale 5 if for every SDP solution cp : V — > H satisfying 
H 5>v {V) = 0, 

\{{u,v) G E : \\<p{u) - v(v)\\ 2 < S/2}\ < 25 2 X. 

A graph G = (V, E) satisfies the geometric expansion property with cut value X up to scale 2~ T (T G N) if 
it satisfies the geometric expansion property for every 5 G {2 _< : 1 < t < T}. 

Claim 5.3. Definitions \2.12\ and \5.2\ are equivalent. 

Proof Sketch. It is easy to see that every graph satisfying Definition 12. 12l satisfies Definition l5.2t we simply 
let M = V. Assume that G = (V, E) satisfies Definition [5721 Consider an SDP solution tp : V -> Ti 
and a set M such that ips :V> (M) = 0. Replace if with tp': <p>'(u) = <p>(u) if u G M, and ip'(u) = e u 
otherwise, where {e u } u is a collection of orthogonal unit vectors, orthogonal to all vectors tp(u). The l\- 
distance between every vector ip'(u) = e u (u G V \ M) and any other vector <p'(v) is at least 1. Thus, 
H s ,<p(M) C H Si9 (V) = 0. Hence, 

\{(u,v) G En (M x M) : \\<p(u) - <p(v)\\ 2 < 5/2} | = 

= |{(u,w) G E : \\ip'{u) -<p'(v)f < 5/2}\ < 25 2 X. 

□ 

Proof of Theorem \5J\ We use Definition l5.2l in this proof. Let Ek = {{u, v) G V x V : V(u) ^ V(v)} and 
Eji C Ek be the set of random edges chosen for the set SR(V, e) as in Definition 12. 6i Since cut("P, E) C 
Er it suffices to show that the graph (V, Er) is geometrically expanding with high probability. We fix the 
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parameter 5 = 2~* (where 1 < t < T), and prove that the graph (V, En) is geometrically expanding with 
cut value X at scale 5. Then we apply the union bound for all T = log 2 D possible choices of 5. 

We use the technique developed by Kolla, Makarychev and Makarychev [29 ]. Observe that the condition 
Hs,tp(V) = implies that 

\{v eV :\\p(u) -p(v)\\ 2 <5}\ <5 2 n, 

and, consequently, 

\{(u,v) G V x V : \\(p(u) - f(v)\\ 2 < S}\ < 5 2 n 2 . 
Thus we need to bound the probability of the bad event: there exists an SDP solution ip : V — > 7~L such that 

\{(u, v) G V x V : \\<p(u) - <p(v)\\ 2 < S}\ < 5 2 n 2 (7) 

and 

\{(u,v) G E R : \\<p{u) - v{v)\\ 2 < 5 -}> 25 2 X. (8) 

We now show that if such ip exists then there exists an embedding ip' : V — > Ng to a relatively small set 
Ns CH satisfying slightly relaxed conditions: 

\{(u,v) eVxV: \\ip'(u) - ^W|| 2 < j6}\ < \ 5 2 n 2 , (9) 

and, 

|{(u,t;) G E R : y'(u) - <p'(v)\\ 2 <ls}\> h 2 X. (10) 

Here Ns C H. is a set of size exp(0(log 2 (l/(5)) depending only on 5. Then, we argue that such ip' exists 
with very small probability. 

Claim 5.4. If \Er\ < 2X and there exists p : V — > % satisfying ([7]) and (HJ), then there exists ip' : V — > Ns 
satisfying ((9]) and kffib . 

Proof. We use the following simple lemma proved in [29]. 

Lemma 5.5. (Lemma 3.7 ||29l , arXiv VERSION) For every positive Q, rj and v, there exists a set Ns of 
unit vectors of size at most 

exp(0(C 2 log(l/f7)log(l/i/))) 

such that for every set of unit vectors Z there exists a randomized mapping : Z — > N satisfying the 
following property: for every u,v G Z, 

Pr((l + C) -1 !^ - v\\ 2 - rj 2 < \\if>(u) - ip(v)\\ 2 < (1 + ()\\u - v\\ 2 + rj 2 ) > 1 - v. (11) 

The proof of Lemma 1531 is based on the Johnson-Lindenstrauss lemma: The set N is an "epsilon-net" 
in a low dimensional space. To construct ip we first project Z in a low dimensional space using the Johnson- 
Lindenstrauss transform and then "round" each vector to the closest vector in N. See [29] for details. 

We set parameters ( = 1/7, r] 2 = 5/8 and v = 5 2 /8 and pick Ns as in Lemma 1531 Then we choose a 
deterministic tp(u) : <p{V) — > N such that the condition 

7 -\\p(u) - - 5 - < \M<p(u)) - ^(p(v))f 

holds for at least a (1 — <5 2 /4) fraction of all pairs u, v G V and at least a (1 — <5 2 /4) fraction of all edges 
(u,v) G Er (the existence of such ip follows from (fTTV by the probabilistic method). Define <p'(u) = 
ip{<p{u)). We get: 
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• for all but at most 5 2 /4 n 2 pairs u, v G V, if \\<p(u) — ip{v) || 2 > 5, then — (p'(v)\\ 2 > 
7/8 S - 5/8 = 3/4 5; 

• for all but at most <5 2 /4 \E R \ < 5 2 /2 X edges («,«) G £? fl if ||^(«) - </?(u)|| 2 < <5/ 2 > tnen II - 
<^»|| 2 < 8/7 • 5/2 + 5/8 < 3/4 5. 

Therefore, inequalities © and ([Tol l hold. □ 

Observe, that E|i?fl| = sr-cost(7 ? , e) < X. Hence, by the Chernoff bound (for some absolute constant 

Ci), 

Pr(|^ fl | > 2X) < e" ClX . 

Similarly, by the Chernoff bound, inequalities © and (fTOb simultaneously hold with probability at most 
e -C 2 5 X Th us? a fi xec j (p' ; V ^ N satisfies (© and (fTOl) with probability (over random choice of Er) at 
most e~ C:i6 x . The total number of different embeddings ip' : V — > N equals |A^| n < exp(C4nlog 2 D). By 
the union bound the probability that at least one such ip' exists is at most e~ c ' iS x +C4.n\og D ^ g -n ^ ere W£ 
use that 5 2 X > Cn(log 2 D) for sufficiently large C. 

Part II follows from Part I by taking the union bound over all 2 n possible choices of the set U. We omit 
the details in this version of the paper. □ 



6 Recovering the Partitions in the Planted Model 

In the case of the Balanced Cut and Small Set Expansion problems, we can obtain better guarantees when 
the sets of the partition V = {S, T} have enough expansion within them. Note that to recover the planted 
partition, we need some conditions on the graph expansion inside G[S] and G[T]: otherwise, there may exist 
a sparse cut in G cutting both S and T (for example, if the graphs G[S] and G[T] are random G(n/2,e) 
graphs, then the graph G is a G(n,e), and thus the sets S and T are indistinguishable from other sets of 
size n/2). This assumption is in the flavor of planted instances of Balanced Cut (or Small Set Expansion 
problem), where the cut given by the partition (5, T) is much sparser (sparser by a constant factor) than 
any cut inside the (adversarial) graph restricted to S or T. (This assumption is also similar to the stability 
assumption of Balcan, Blum, and Gupta [7] for clustering problems, where a c-factor approximation to the 
partitioning problem is 7y(c)-close to the target partition.) In this case, we can find the partition (S, T) up to 
(1 + r])-accuracy for some sub-constant r\ > i.e., a partition differing from (5, T) in at most r\n vertices. 
We obtain these guarantees by repeatedly defining instances of the Sparsest Cut problem, and using our 
algorithms for the semi-random model to obtain increasingly finer approximations to the planted partition. 

Definition 6.1. Denote by h{G) the expansion of the graph G = (Vq, Eg)-' 

_ . E(S,V G \S) 

h(G) = mm — . 

ScV G \S\ 
0<|S|<i/2|V g | 

Theorem 6.2. There exists a randomized polynomial-time algorithm, a function f : N — > [0, 1] tending to 
as n — > oo, and positive absolute constants C, C exp , such that for every set of vertices V of size n, every 
partition V = {S, T}, \S\ = pn (for p G (0, 1/2] ) and every e G (0, 1), r\ G (0, 1), satisfying 

C ^/log n(log log n) 2 



V 



> 



en 

the following statement holds with probability 1 — f(n) = 1 — o(l) over a random choice of SR(V, e): For 
every G = (V, E) G SR(V, e) satisfying h(G[S]) > C exp en and h(G[T]) > C exp en, the algorithm given 
G and e, returns a partition (X, Y) ofV such that 

\XAS\ = \YAT\ < rjn or \XAT\ = \YAS\ < nn. 
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Remark 1: The conditions h(G[S]) > C exp sn and h(G[T}) > C exp en can be slightly relaxed, by requiring 
that only sets of size at least rjn expand in G[S] and G[T]. 

Remark 2: We assume that r\ < p/3. Otherwise, if p < 77, then the trivial solution (0, V) satisfies the 
conditions of the theorem. If rj G [p/3, p], we may replace r/ with 7/ = p/3 and slightly change the absolute 
constant C. 

Our algorithm relies on the Sparsest Cut algorithm for the semi -random model presented in Section l3~4l 
We denote the approximation factor of the Sparsest Cut algorithm by Csc (see [5]). We let C exp = ACsc- 
We will use this algorithm for finding approximate sparsest cuts in G[X] for various X C V satisfying 
\X fl S\, \X Pi T\ > rjn/2 (sometimes these conditions on X may be violated, then we assume that the 
algorithm returns a solution A, but the cut (A, X\A) may be arbitrarily bad). By Theorem 13 .61 the Balanced 
Cut algorithm finds a cut of sparsity at most Csc^ n with probability exponentially close to 1 unless the graph 
G does not satisfy the "strong geometric expansion" property described in Theorem 15. II part II. This happens 
with probability o(l); and in this case, the partition recovering algorithm described below fails as well. 

We introduce a potential function / that measures the quality of a partition (X, Y): 

f(X,Y) =C S cenmm(\X\,\Y\)-\E(X,Y)\. (12) 

The algorithm presented below tries to maximize / by finding non-expanding subsets A in X and moving 
them to Y and finding non-expanding subsets B in Y and moving them to X. 

Algorithm. The algorithm first finds an approximate sparsest cut (Xq, Yo) in G using the Sparsest Cut 
algorithm for semi-random graphs. Then, it repeats the following refinement procedure: find approximate 
sparsest cuts (A,X t \ A) in the graph and (B,Y t \ B) in the graph G[Y t ] using the Sparsest Cut 

algorithm for semi-random graphs and 

• iff(X t \A,Y t UA) > /(X t ,y t )+V4>moveAfromX t tor t i.e.,setX t+ i = X t \AandF t+1 = Y t UA; 
otherwise, 

• if f(X t UB, Y t \B) > f(X t ,Y t )+y±, move B from Y t to X i.e., set X t+1 = X t UB and Y t+1 = Y t \B. 

The order in which the algorithm considers the cases above does not matter. After each iteration the algorithm 
increases the counter t. The algorithm stops and outputs the cut (X t , Y t ), when neither moving A from X to 
Y, nor moving B from Y to X increases f(X, Y) by at least 

Analysis. Notice that the number of iterations of the algorithm is polynomial, since f(X, Y) is upper 
bounded by C,sc*£n 2 , lower bounded by —\E\, and at every iteration (but last) / is increased by at least 
Thus, the algorithm runs in polynomial time. To prove that the algorithm works correctly, we need to show 
that the algorithm does not stop till (X t , Y t ) is 77-close to the planted solution (S, T) i.e., till |X t A5| < r\n 
or \Y t AS\ < 7]n. 

We first prove that f(X t , Y t ) is positive for every t. The Sparsest Cut algorithm finds a cut {Xq, Yq) of 
sparsity at most Csc£ n > hence J(Xq, Yq) = Csc£^ mm (|^o|> |^o|) — l-^G^Oj ^o)| > 0. Since the sequence 
f(X t , Y t ) is increasing, f(X t , Y t ) is positive for every t. Consequently, the sparsity of every cut (X t , Y t ) is 
at most Csc^ n - 

We show that every relatively small set in G expands. 

Claim 6.3. For every set U C V of size at most 2pn/3, E(U, V\U)> C exp en\U\/2. 
Proof. Since h(G[S]) > C exp en, we have 

E(UnS,S\{UDS)) > C exp en ■ min{\U n s\,\s \ (u n S)\) 
> C exp en\UnS\/2, 
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where the second inequality follows from \UnS\< 2pn/3 < 2(\S\ - \U n S\) = 2\S \ (Un S)\. Similarly, 

E(UHT,T\ (UHT)) > C exp en\Ur\T\/2. 

Thus, E(U, V\U)> C exp en\U\/2. □ 

As a corollary, we get that \X t \ > 2pn/3 and \ Y t \ > 2pn/3 for every t (otherwise, the sparsity of the 
cut (X t , Y t ) would be large). To argue that the Sparsest Cut algorithm finds a Csc^n sparse cut in GpQ] or 
G[Yt], we need to prove the following claim. 

Claim 6.4. Suppose that the partition (Xt, Yt) is not nn close to the planted partition (S, T) i.e., \Xf/\S\ = 
\Y t AT\ > rjn and \X t AT\ = \ Y t AS\ > nn, then one of the following two statements holds: 

• \X t r\S\> nn/2 and \X t n T\ > nn/2; or 

• \Y t n S\ > r]n/2 and \Y t n T\ > rjn/2. 

Proof. The set X t is covered by S and T, and thus \X t n S\ > \X t \/2 or \X t C)T\> \X t \/2. Assume that 
\X t C\S\> \X t \/2. Then \X t n S\ > \X t \/2 > pn/3 > r\n. If also \X t C\T\> r]n/2, we are done. 

Otherwise, we have \X t n T\ < rjn/2, and \X t \ S\ = \X t n T\ < rjn/2. Consequently, 
\Y t nS\ = \X t AS\ - \X t \S\ >rjn- rpn/2 = rjn/2. Also, \Y t n T\ = \T\ - \X t C\T\> pn- rjn/2 > r\n. 

The case |TnX t | > |X t |/2 is handled similarly. (Note that we have not used in the proof that \S\ < |T|; 
we only used that \T\ > pn.) □ 

Apply Claim l6~4l and suppose without loss of generality that \X t D S\ > nn/2 and \X t n T\ > nn/2. 
Then, the set X t is partitioned in two pieces X t n S and X t nT each of size at least nn/2. Thus (as discussed 
in the beginning of the proof), the Balanced Cut algorithm finds a cut (A, X t \ A) (where \ A\ < \X t \/2) of 
sparsity at most Csc^n. We now show that the cut (A, V \ A) is large. 

Claim 6.5. Suppose that the graph G is partitioned into three non-empty sets U\, U2, U3, then one of the 
sets Ui has large expansion: for some i, 

E{U u V\Ui)>C exp en\Ui\. 

Proof. Observe that for one of the sets U it \UiC\S\ < |5|/2and \UiC\T\ < \T\/2. For this set, E(UinS,S\ 
Ui) > C exp en\Ui n S\ and E(U t nT,T\Ui)> C exp en\U t n T\. Hence, E(U i: V\U{)> C exp en\Ui\. □ 

Consider the partition of G into three sets X t n A, X t \ A and Y t . One of them has expansion C exp en. 
It cannot be the set Y t , since the expansion of Y t is at most Csc^ n - Then, 

E(X t \ A,V \ (X t \ A)) < E(X t ,Y t ) + E(X t \ A, A) 

< C SC en\X t \ + C sc en\A\ 

< 3C SC en\X t \ A\ < C exp en\X t \ A\. 

Thus the set with expansion at least C exp en is A, that is, E(A, V \ A) > C exp e\A\ n. 
Estimate the change in the potential function / after moving A from Xt to Yp. 

f(X t \ A, Y t U A) - f(X t , Y t ) > -C SC \A\en - (E(A, X t \ A) - E(A, Y t )) 

= -C S c\A\en - E(A, X t \A) + (E(A, V \ A) - E(A, X t \ A)) 
= -C SC \A\en - 2E(A, X t \A) + E(A, V \ A) 
> -C S c\A\en - 2C SC e\A\ n + 3/ 4 C exp e\A\ n + 1/4 E(A, V \ A) 
= y±E(A,V\A) > 1/4. 
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7 Second Model: Algebraic Expansion inside Partitions 



In the previous sections, we have seen that we can get much better approximation algorithms for partitioning 
problems when the edges E% crossing the boundaries of partition V satisfy some structural property (geo- 
metric expansion). In this section, we show that we can obtain good approximation algorithms for Balanced 
Cut and Small Set Expansion, when the edges Ek not crossing the partition boundaries satisfy some alge- 
braic expansion condition. This is a much weaker condition than edges of Ek being chosen independently 
at random. More crucially, in this case, the edges Ek can be arbitrary. Our algorithms are inspired by 
the results of 0|3T1, where they infer global correlations between the vectors from local correlations and 
algebraic expansion. 

Theorem 7.1. (BALANCED Cut) There is a polynomial-time algorithm, that given a graph G = (V, E) on 
n vertices with a "planted" bisection V = {P\, P2} (not known to the algorithm) of cut value em, such that 
for some subset of edges E\ C E of size \Ei\ = m, the graph G\ = (P\,E\) is a regular expander with a 
(normalized) algebraic expansion \(G\) > 64e, finds a balanced cut of sparsity 0(e). 

Proof. Consider the Balanced Cut SDP used in Section [3Tl Since this is a relaxation, the SDP value SDP < 
em. In particular, 



- sdp-cost(u — > u, Ei) = — ^2 \\u — v\\ 2 < em, 



4 

(u,v)eEi 



and, since \E\\ = m, 



^ E (u,f)GEi [11^ -^ll 2 ] = ^7 Yl W U ~ V W 2 ^ £ - 

For the regular graph G\, we have 



A(Gi) = min 



K (u,v)€E 1 [\\U ■ 



thus 



{u} uev E U) „ G p 1 



\^pA\\--n 2 ]< J ^- ) <l 4 - 



Hence, there exists u* G -Pi such that E ve p 1 \\\u* — v\\ 2 ] < 1/16. Denote d(u, v) = \\u — v\\ 2 . By Markov's 
inequality, 

|Ball d (n* ) ^)nP 1 |>^ = j. 

On the other hand, | Ball^u*, 1/4) | < 4/5 n (as shown in Section [3TTT >. 

We are ready to describe the algorithm: The algorithm guesses the vertex u* and picks a ball S = 
Ballrf^*, r) of radius r G [1/16, 1/4] around u* with the smallest edge boundary. The cost of the cut 
(5, V\S) is at most 32 • SDP, and (since Ball(u*, 1/16) C S C Ball(u*, 1/4)), 

□ 

Theorem 7.2 (Small Set Expansion). There is a randomized polynomial-time algorithm, that given a graph 
G = {V,E) with a "planted" partition V = {Pijf^} (|Pi| = pn) (not known to the algorithm) with 
E(P\, P2) < em such that for some subset of edges E\ C V of size \E\\ = m, the graph G\ = (Pi, Pi) 
is a regular expander with a (normalized) algebraic expansion X(G±) > 16e, finds a set S of size pn/A < 
\S\ < 2pn with expected cost of the cut 0(em). 



26 



Proof. Let {it}ueV(G) be the solution of the C-SDP for the Small Set Expansion considered in Section [3] 
Denote 



SDP\ Pl = sdp-cost(ii — > u, Ei) = — ^2 



2 , ::U~V\\ 2 . 



Since SDP is Pi -local relaxation of V, SDP\ Pl < OPT = em (see Lemma lATTT) . We first proceed 
similarly to the proof of Theorem 17.11 Write, 



A = rain 



E (^)e£i [\\ u ~ v \\ 2 ] 



{a}uev ^u,vePi [\W ~ v\\ 2 } 

then 



n SDP Pl OPT 1 



Hence, there is a vertex u* G Pi, such that 

E„ ePl [||u-v|| 2 ] < 1/16. 

Let d(u, v) = \\u — v\\ 2 . By the SDP spreading constraint (as shown in Section |33T ). BalLi(-u*, i) < 8/7 pn. 

Thus, for some radius r G [1/16, 1/4] (the algorithm can guess u* and r by considering all possibilities), 
the set S = Ball^u*, r) contains at least |Pi|/2 vertices from Pi, but at most 8/7 pn vertices in total. 
Furthermore, the cost of the cut E(P 1 n S, V \ S) is at most 320PT. 

The main difficulty and the main difference from the previous proof (Theorem 17 .11 1 is that the set S may 
contain vertices from P 2 and, moreover, it may cut many edges in E(P 2 (1 S,V \ S). However, we have 
already dealt with a similar problem in Section 13.31 We use the LP (from the proof of Theorem 13.41 Case 
2) to extract solution of cost at most 0(OPT) from S. The LP is feasible with LP value at most 0(OPT), 
because one integral "canonical" solution exists: it is the set S' = Pi D S. Indeed, P(Pi n S, P 2 n S) < 
E(Pi,P 2 ) = OPT, and thus E(P 1 n S, V \ (Pi n 5)) < 330PT. □ 
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Lemma A.l. The set of feasible solutions of the Crude SDP (C-SDP) given in Section [PI for the Small 
Set Expansion problem is a S -local relaxation of every partition V = {S, V \ S} (where \S\ = pn). 

Proof. Let cp = argmin^g^ sdp-cost(</?, E). Denote u = <p{u). Define a new SDP solution 



A Local SDP Relaxation for SSE 




ej_ if u € S 
u otherwise. 
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where e± is a unit vector orthogonal to all the vectors {v} vG v(G)- This solution also satisfies the ^-triangle 
inequalities, the spreading constraints (because \ S\ < pn and for all u E S, v G V\S, (v! ', «') = < (u, ?;}), 
and for all u, u 6 V, (u', v') > 0. Thus, it lies in <J>. 

Compute the cost of the new solution and compare it with the cost of the optimal solution: 



sdp-cost(n — > u' ,E) = — 



2 , „U -V II 

(n,i))G£(G) 



5 E ii«'-«'» 2 4 E 



I-/ — / 1 1 2 

2 ^ „- - „ . 2 ^ \u-vW 

(u,v)£E{G) (u,v)£E{G) 
u£Sorv£S u,veV(G)\S 



cost ]S (V,E) + - \\u-v 



2 

> i / / — r i 

2 

(u,v)£E{G) 
u,vGV(G)\8 



The cost sdp-cost(u — > u, E) of the optimal solution u equals 



sdp-cost|5(u —}u,E) + ~ j a — r| ' 

(«,«)€B(G) 
u,veV(G)\,S 

Thus, sdp-costi s (« — > u, E) < cost\s(P, E). □ 



B Remark on Theorem 3.5 



Theorem 13.5 l is stated in a slightly different form in Bansal et al. JH. We use Theorem 2.1 (part II) |[8] p. 6; 
arXiv, version 2] with p(S) = rj(S) = \S\/n and H = p. Theorem 2.1, as is, does not deal with weights 
w u , so we need to very slightly change the algorithm and proof. We add an extra SDP constraint 

Y \\u\\ 2 w u < W. 
uev 

This constraint is clearly satisfied in the integral solution. We also change the function /' (see page 10 of [8 ]). 
We let (here w u are the weights of vertices) 

f"(S) = f'(S) - x H 

32W 

new term 

= v(S) - -t^t x 



\E\ AD x SDP 

H(S) „ w(S) 

— x H x H 

\p 32W 

3\S\ E(S,V\S) p w(S) 

~\~n \E\ X 4L> x SDP ~ 32W 



x p. 



Since, 



32W ~ ^ 32W ~ ^ 32W 32 ' 
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we get (compare with the third formula on page 10 [8 1), 



E[/f ( S )>E[f]'(S)-^->?§. 

This is sufficient for analysis in JSJ. The SSE algorithm finds a set S with f"(S) > 0. This condition implies 
that 

3\S\ w{S) 



> 



An ' W 

and, consequently (as |5| = 0(pn)), w(S) < 0(W). 



x P 



C SDP relaxations 

Minimum Balanced Cut: The input to the problem is a graph G(V, E), and the objective is to find a set S 
of size n/2 with the minimum number of edges crossing it. 



mm 



subject to 



for all u,v,w £ V, 
for all u 6 V, 



\ E 



u — v 



(u,v)£E(G) 



1 ,, ,,9 n 2 

4 E II---H 2 >y 

U,VdV 

\\u — v\\ 2 + \\v — w\\ 2 > \\u — w\\ 2 



(Spreading constraint) 



(^ 2 -triangle inequalities) 
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Crude SDP (C-SDP) for Small-Set Expansion (SSE): The input to the problem is a graph G(V, E) 
and a parameter p, and the objective is to find a set S of size pn with the smallest number of edges crossing 
it. 



mm 



subject to 



for all u 6 V, 

for all u,v,w G V, 
for all u, v € V 
for all u G V, 



it — f 



Mefi(G) 



E(«)V) < pn 



W — v\\ 2 + \\v — w\\ 2 > \\u — w\\ 2 



{u,v) > 



(Spreading constraints) 



(^2-triangle inequalities) 



Minimum Multicut: The input to the problem is a graph G(V, E) and a set of k source-sink pairs 
{(si, tj)}i<i<fc, and the objective is to find a partition V of the graph with minimum number of edges across 
partitions such that for all i, V(si) ^ V'(U) . 



mm 



\ E 



\u — v\\ 



(u,v)eE(G) 
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subject to 



for all 1 < i < k, (si, U) = 
for all u,v,w 6 V, \\u — v\\ 2 + \\v — w\\ 2 > \\u — il)|| 2 (^-triangle inequalities) 

for all u e V, llull 2 = 1 
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